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ABSTRACT 



A method, apparatus and computer program product for 
reducing the data transmitted over a communication link 
from a first application resident in a first computer and to a 
second application resident in a second computer wherein 
the data is transmitted over an external communication link. 
HTML data in response to specific data in a CGI request is 
stored in a cache resident in the first computer to create a 
server base cache entry. HTML data in response to the 
specific data in the CGI request is also stored in a cache 
resident in the second computer to create a client base cache 
entry. CGI requests and specific CGI data is evaluated to 
determine if a corresponding client base cache entry exists 
to provide a client base form. CGI requests and specific CGI 
data are interrogated to determine if a server base cache 
entry exists to provide a server base form. The data stream 
corresponding to the response HTML data is intercepted and 
compared to the server base form to provide difference data. 
The difference data is then sent to the second computer over 
the external communication link. The response HTML data 
is reconstructed from the client/server specific data stream 
received over the external communication link by combin- 
ing the client base form with the difference data and pro- 
vided to the second application. Creation of an extended 
cache key corresponding to the CGI form and the user input 
data is also provided. ^ _ 

18 Claims, 7 Drawing Sheets 
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CGI RESPONSE DIFFERENCING 
COMMUNICATION SYSTEM 

RELATED APPLICATIONS 

This application is a continuation-in-part (CIP) of appli- S 
cation Ser. No. 601,804, filed Feb. 15, 1996, now U.S. Pat. 
No. 5,754,774. 

FIELD OF THE INVENTION 

10 

The present invention relates to communications between 
client server applications such as a web browser and a web 
server. More particularly the present invention relates to 
communications over a low-speed or wireless communica- 
tion link between two computers, one running a client 15 
application and the other running a server application. 

BACKGROUND OF THE INVENTION 

The recent publicity and emphasis on the "information 
superhighway" has increased awareness and acceptance of 20 
the Internet as a mass communication media. This broad 
based recognition of the Internet as a viable media for 
communication and interaction across multiple networks has 
also created a large established user base built upon the 
Internet standardized protocols for interaction between com- 25 
puter networks. 

The paradigm for the Internet is that of a client-server 
relationship where Internet clients (browsers) communicate 
with Internet servers. To provide greater access to the 3Q 
Internet the communication protocols and languages utilized 
by the clients and servers have become standardized. These 
protocols include the Hyper-Text Transfer Protocol (HTTP), 
which is the communication protocol used for communica- 
tions between clients and servers, and the Transfer Control 35 
Protocol/Internet Protocol (TCP/IP) the TCP portion of 
which is the transport specific protocol for communication 
between computers or applications. Also standardized is the 
language in which clients and servers communicate which is 
called Hyper-Text Markup Language (HTML). Because 40 
these protocols and language are machine independent, and 
utilize a connection-less best-efforts protocol to sending 
information, each transaction is fully self contained. Thus, 
for example, each message from a client contains informa- 
tion about the capabilities of the browser and is independent 45 
of any other communications for the communication to be 
completed. This self-contained nature of the communica- 
tions between a client and a server may be referred to as 
"stateless" communications and increases the amount of 
data which must be transferred between a client and a server 50 
for a given communication. 

In the context of the World Wide Web client/server 
applications the client may be a web browser which acts as 
the user interface. The web browser sends user requests to 
the appropriate web server and formats and displays the 55 
HTML data returned from the web server. The web browser 
also evaluates the HTML data to determine if there are any 
embedded hyper-fink statements in the HTML data which 
would require subsequent browser requests which would 
then be initiated by the browser. A web server acts as the 60 
server for the client and processes the web browsers requests 
and returns the requested response as an HTML data portion 
of a HTTP data stream. 

As an example of a typical world wide web communica- 
tion the case of a web browser initiating a request for a 65 
"home page" from the web server illustrates the basic 
relationship between HTTP, HTML, TCP and the web 
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browser and server. When the user of the web browser 
requests information from a specific web site, the web 
browser initiates communication with the web server by 
sending a "get" request to the web server specifying the 
Uniform Resource Locator (URL) of the desired web site 
which, for purposes of this example, may be a "home page/' 
The URL acts as the address of the web site and is unique 
throughout the Internet. The web server would then obtain 
and supply the web browser with the HTML data corre- 
sponding to the home page specified by the URL. This 
operation may involve further communications on the Inter- 
net by the Internet web server or the URL may specify the 
server which is in the local network to which the browser is 
attached. The web browser would then evaluate the HTML 
data received as an HTTP data stream from the web server 
to see if there were any embedded hyper-links such as an 
icon or an image and, if such a byper-link exists would 
initiate requests specifying the URL of the hyper-fink to 
obtain the specified data. This data would then be incorpo- 
rated into the home page and displayed to the user. As is seen 
in this simple example, a single user input request by a web 
browser may result in multiple additional requests which are 
automatically carried out by the web browser in response to 
the receipt of the HTML data corresponding to the user input 
request. 

The basic communication structure for an Internet based 
system is depicted in FIG. 1. In FIG. 1 a web browser 10 
communicates with a web server 20 over a communication 
link 15. This communication link is typically a local area 
network connection, wide area network connection, a con- 
nection over telephone lines or a combination thereof. The 
web browser 10 communicates with the web server 20 using 
TCP/IP. For the majority of Internet communications a web 
browser communicates with a web server using the generic 
communication protocol HTTP which is transmitted 
between the web browser and the web server over the 
TCP/IP link between the web browser and the web server. 
The actual data transferred between the web browser 10 and 
the web server 20 are HTTP data objects (e.g. HTML data) 
as described above. The web server 20 may be a proxy 
which receives web browser communications from a num- 
ber of web browsers and routes them to the appropriate 
server. 

The popularity of the web browser/web server and their 
common information and transport protocols, HTML and 
HTTP, has lead to rapid acceptance of web technology as a 
universal interface for network access to information. 
Furthermore, because the protocols and language for com- 
munication between web browsers and web servers are 
standardized the communication protocols and language 
will be the same whether a user is using Netscape 
Navigator™, NCSA Mosaic™, WebExplorer™ or any other 
web browser as their web browser to access network infor- 
mation. Therefore, the large installed user base for web 
browsers combined with the connectivity of the Internet and 
the ease of writing web application servers using the HTTP 
defined Common Gateway Interface (CGI)make web tech- 
nology very attractive for a large class of forms-based 
applications. 

At the same time that the Internet was growing in popu- 
larity and acceptance, mobile computing was also increasing 
in popularity. The use of laptops, notebooks, Personal 
Digital/Communication Assistants (PDAs/PCAs) and other 
portable devices has lead to an increase in demands for 
wireless communications. Wireless wide area networks, 
cellular communications and packet radio, however, suffer 
from common limitations if used in a web context. The high 
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cost per byte of communications, slow response time, low 
bandwidth and unreliability all hamper use of wireless 
technology for the stateless communication protocol of the 
World Wide Web. Also, because the web protocol is stateless 
the amount of data per request and the number of commu- 
nication requests transferred over the wireless connection 
are larger than would be necessary if the communication 
were not self contained. Thus, combining wireless 
technology, or any low-speed communication technology, 
with web technology seems impractical as the strength of the 
web technology in its universal nature exacerbates the 
weaknesses of the wireless technology. 

OBJECTS AND SUMMARY OF THE 
INVENTION 

In view of the above limitations it is one object of the 
present invention to provide a communication system which 
reduces the amount of data to be transferred between appli- 
cations. 

It is an additional object of the present invention to 
provide a communication system which may be used in a 
web browser/server environment. 

It is a further object of the present invention to be 
compatible with existing communication protocols and lan- 
guages in a low speed or wireless communication system 
without requiring modification of web browser or web 
server applications. 

It is an additional object of the present invention to 
provide a communication system which reduces the amount 
of communication required between a web browser and a 
web server and thereby enhances performance of the com- 
munication system. 

In view of these and other objects, the present invention 
provides for reducing the data transmitted over a commu- 
nication link from a first application resident in a first 
computer and to a second application resident in a second 
computer wherein the data is transmitted over an external 
communication link from the first computer to the second 
computer Aspects of the present invention include storing 
HTML data from the first application to be provided to the 
second application in response to specific data in a CGI 
request from the second application in a cache resident in the 
first computer to create a server base cache entry. HTML 
data to be provided to the second application in response to 
the specific data in the CGI request from the second appli- 
cation is also stored in a cache resident in the second 
computer to create a client base cache entry. CGI requests 
and specific data associated with the CGI requests from the 
second application is evaluated to determine if a client base 
cache entry corresponding to the evaluated CGI request and 
specific data exists to provide a client base form. CGI 
requests and specific data associated with the CGI requests 
from the second application are interrogated to determine if 
a server base cache entry corresponding to the interrogated 
CGI request and specific data exists to provide a server base 
form. The data stream corresponding to the response HTML 
data from the first application in response to the interrogated 
CGI request and the specific data from the second applica- 
tion is intercepted prior to transmission of the response 
HTML data on the external communication link. The inter- 
cepted response HTML data is then compared to the server 
base form to provide difference data corresponding to the 
difference between the intercepted response and the server 
base form. The difference data is then sent to the second 
computer over the external communication link. The differ- 
ence data transmitted over the external communication link 
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sent is acquired by the first computer and the response 
HTML data corresponding to the communication from the 
first application is reconstructed from the client/server spe- 
cific data stream received over the external communication 

5 link by combining the client base form with the difference 
data received over the external communication link to create 
a response data stream corresponding to the intercepted 
response. The reconstructed data stream corresponding to 
the intercepted response HTML data is then provided to the 

to second application. 

By providing for differencing of responses to CGI 
requests, the present invention may reduce the amount of 
data transferred over an external communication link, such 
as a wireless link. Because responses to CGI requests may 

15 vary greatly in their data, caching multiple base forms for 
differencing allows for a closer match between the response 
data and the base form which results in less data. The use of 
the extended cache key based on the URL and the specific 
input data allows for the creation of the multiple base forms 

20 for a URL. 

In a further embodiment of the present invention, the 
HTML data from the first application stored to create a 
server base cache entry and the HTML data to be provided 
to the second application stored to create a client base cache 

25 entry are stored with the Uniform Resource Locator (URL) 
of the CGI request and the specific data of the CGI request 
as a key to the cache entries. In such a case, the evaluation 
of CGI requests and specific data may include determining 
if a client base cache entry has a URL and specific data 

30 corresponding to the evaluated CGI request and the inter- 
rogation of CGI requests and specific data may include 
determining if a server base cache entry has a URL and 
specific data corresponding to the interrogated CGI request. 

35 In another aspect of the present invention, a client base 
form is selected with the closest match to the URL and 
specific data. Also, then a server base form is selected which 
provides the closest match to the URL and specific data. 
In another embodiment of the present invention, the 

40 server and client cache entries are only stored if the number 
of client cache entries and server cache entries correspond- 
ing to the CGI request is less than a predetermined maxi- 
mum entry value. 

In still another embodiment of the present invention, it is 

45 determined if the server base form is identical to the client 
base form. The server base form and the difference data are 
then transmitted to the second computer over the external 
communication link if the server base form is not identical 
to the client base form. Then the intercepted response data 

50 stream corresponding to the response from the first appli- 
cation is reconstructed by combining the server base form 
received over the external communication link with the 
difference data received over the external communication 
link to create a data stream corresponding to the intercepted 

5 S response. The client base form corresponding to the inter- 
rogated request is also updated by storing the received server 
base form as the client base cache entry corresponding to the 
interrogated request 
In particular embodiments of the present invention, the 

60 first application comprises a web server and the second 
application comprises a web browser. Furthermore, the 
external communication link may be a wireless communi- 
cation link. 

In another embodiment of the present invention a base 
65 form for a CGI response differencing communication system 
is generated by storing the input elements of a CGI input 
form and storing the parameters of a request using the CGI 
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input form. When the response to the request using the CGI 
input form is received, an extended cache key is generated 
from the URL of the CGI request, the input elements of the 
CGI input form and the parameters of the CGI request using 
the CGI input form. The response to the request is then 
stored as an entry in a cache using the extended cache key 
as a key to the cache entry. The extended cache key may be 
generated by generating an extension to a URL of the CGI 
request including non-default parameters of the request to 
provide an extension string and appending the extension 
string to the URL of the CGI request. 

In a further aspect of the present invention, input param- 
eters having a set of possible values greater than a predefined 
value are excluded from the extension string. Text element 
values may also be excluded from the extension string. 

In further aspects of the present invention, difference data 
between an entry in the cache having a first extended cache 
key and a subsequent response to a CGI request having the 
same extended cache key is generated and transmitted over 
an external communication link. Furthermore, a CGI 
response may be reconstructed from difference data associ- 
ated with the extended cache key and the cache entry 
associated with the extended cache key. 

As will be appreciated by those of skill in this art, the 
above described aspects of the present invention may also be 
provided as an apparatus or a program product having 
computer-readable program means. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of a typical web browser/web 
server system; 

FIG. 2 is a block diagram of a web browser/web server 
system according to one embodiment of the present inven- 
tion utilizing a client intercept and a server intercept; 

FIG. 3 is a flow diagram depicting operations carried out 
by both the server-side and client-side intercept modules in 
a preferred embodiment of the present invention; 

FIG. 4 is a flow diagram of the creation of an extended 
URL according to the present invention; 

FIG. 5 is a flow diagram depicting operations associated 
with receiving a request from a browser carried out by a 
client-side intercept module in a preferred embodiment of 
the present invention implementing a differencing data 
transfer system; 

FIG. 6 is a flow diagram depicting operations associated 
with receiving a response from a server-side intercept mod- 
ule carried out by a client-side intercept module in a pre- 
ferred embodiment of the present invention implementing a 
differencing data transfer system; 

FIG, 7 is a flow diagram depicting operations associated 
with receiving a request from a client -side intercept module 
carried out by a server-side intercept module in a preferred 
embodiment of the present invention implementing a dif- 
ferencing data transfer system; and 

FIG. 8 is a flow diagram depicting operations associated 
with receiving a response carried out by a server-side 
intercept module in a preferred embodiment of the present 
invention implementing a differencing data transfer system. 

DETAILED DESCRIPTION 

The present invention now will be described more fully 
hereinafter with reference to the accompanying drawings, in 
which preferred embodiments of the invention are shown. 
This invention may, however, be embodied in many different 
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forms and should not be construed as limited to the embodi- 
ments set forth herein; rather, these embodiments are pro- 
vided so that this disclosure will be thorough and complete, 
and will fully convey the scope of the invention to those 
skilled in the art. Like numbers refer to like elements 
throughout. 

FIGS. 3 to 8 are flowchart illustrations of methods and 
systems according to the inventioa It will be understood that 
each block of the flowchart illustrations, and combinations 
of blocks in the flowchart illustrations, can be implemented 
by computer program instructions. These computer program 
instructions may be loaded onto a computer or other pro- 
grammable apparatus to produce a machine, such that the 
instructions which execute on the computer or other pro- 
grammable apparatus create means for implementing the 
functions specified in the flowchart block or blocks. These 
computer program instructions may also be stored in a 
computer-readable memory that can direct a computer or 
other programmable apparatus to function in a particular 
manner, such that the instructions stored in the computer- 
readable memory produce an article of manufacture includ- 
ing instruction means which implement the function speci- 
fied in the flowchart block or blocks. The computer program 
instructions may also be loaded onto a computer or other 
programmable apparatus to cause a series of operational 
steps to be performed on the computer or other program- 
mable apparatus to produce a computer implemented pro- 
cess such that the instructions which execute on the com- 
puter or other programmable apparatus provide steps for 
implementing the functions specified in the flowchart block 
or blocks. 

Accordingly, blocks of the flowchart illustrations support 
combinations of means for performing the specified func- 
tions and combinations of steps for performing the specified 
functions. It will also be understood that each block of the 
flowchart illustrations, and combinations of blocks in the 
flowchart illustrations, can be implemented by special pur- 
pose hardware-based computer systems which perform the 
specified functions or steps, or combinations of special 
purpose hardware and computer instructions. 

FIG. 2 illustrates one embodiment of the present inven- 
tion. As seen in FIG. 2, a web browser 10 communicates 
with a client-side intercept module 30. The web server 20 
communicates with a server-side interrupt module 40. The 
client-side intercept module 30 then communicates with the 
server-side intercept module 40 over the communication 
link 35. The web browser 10 and the client-side intercept 
module 30 may be contained in a first computer 5. The 
server-side intercept module 40 and the web server 20 may 
be contained in a second computer 6. The first computer 5 
and the second computer 6 communicate over external 
communication link 35. 

Preferably, the web browser 10 is an Internet web browser 
utilizing hypertext transfer protocol (HTTP) and hypertext 
markup language (HTML) to communicate with an Internet 
web server 20 which also uses HTTP and HTML, In 
operation, the web browser 10 would output an HTTP data 
stream which is intercepted by the client-side intercept 
module 30. The intercept of the HTTP data stream by the 
client-side intercept module 30 may be accomplished 
through the use of the TCP/IP loop -back feature where the 
client side intercept module 30 resides at an IP address 
having a network number of 127, such as 127.0.0.1. The 
client-side intercept module 30 then converts or transforms 
the HTTP data stream into a client/server specific protocol 
and transmits the client/server specific data stream onto the 
external communication link 35. The server-side intercept 
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module 40 receives the client/server specific data stream and client side intercept modules would then communicate with 

reconstructs the original HTTP data stream corresponding to the server-side intercept module of the web server or web 

the web browser originated communication. This re con- proxy. 

stnicted HTTP data stream is then transferred t o the web In one embodiment of the present invention, both the 

server 20. The web server 20 responds to the HTTP data 5 client-side intercept module 30 and the server-side intercept 

stream in the normal manner of an Internet web server. As module 40 have cache storage capabilities. The client cache 

will be appreciated by one of skill in the art, the web server resident in the first computer stores HTTP data streams to be 

20 may also be a proxy which allows multiple browsers to received by the web browser in response to a web browser 

connect to the Internet. originated communication. The server cache resident in the 

When information is received by the web server 20 for 1Q second computer stores HTTP data streams which are 

transmission to the web browser 10, for example, in received from the web server in response to a browser 

response to a browser request for a specific URL home page, originated communication. 

the web server 20 outputs an HTTP data stream correspond- As will be appreciated by one of skill in the art, the cache 

ing to the communication to be sent to the web browser 10. resident in the first computer or the second computer may be 

This web server originated communication is intercepted by 15 0 f any size based upon the specific hardware configurations 

the server-side intercept module 40 and transformed to a 0 f the computers. These caches store information for each 

client/server specific data stream. The client/server specific communication including, the URL of the communication, a 

data stream corresponding to the web server originated unique identifier based on the communications contents such 

communication is then sent on the external communication as a cyclic redundancy check (CRC) of the data of the 

link 35 from the second computer to the first computer. The ^ communication, the store date time (SDT) indicating the 

client/server specific data stream is recei ved by the client- time when the cache entry was created or refreshed and the 

side intercept module 30 and the original HTTP data stream data of the communication. Thus, a directory of cache 

corresponding to the web server originated communication entries may be created for each communication stored in the 

is rebuilt and provided to the web browser 10. cache. Because of the limited resources available in any 

In a particular embodiment of the present invention, the 25 given hardware configuration, any number of caching tech- 
external communication link 35 is a wireless communication niques known to one of skill in the art for maintaining the 
link. In such a case, in order to obtain system performance caches resident in the first computer and the second com- 
which is acceptable to users, it is desirable to reduce the puter may be utilized. For example, the cache may invalidate 
amount of communication over the external communication the oldest directory entry if a user defined cache size would 
link 35 both in the frequency of the communications and in 30 be exceeded by the addition of a new entry and then the new 
the amount of information which must be transferred over entry added in place of the invalidated entry. Cache entries 
the communication link 35. Accordingly, the present inven- may also be maintained over multiple instances of the web 
tion utilizes caching, differencing, and protocol reduction browser or web server applications or even power-on cycles 
techniques to minimize the amount of communication of the first or second computers to create a persistent cache, 
required over the external communication link 35. These 35 In one embodiment of the present invention, the key or 
techniques are accomplished by converting the stateless or index into the server-side and client-side caches is the URL 
stochastic protocols of HTTP into a client/server specific of the request. However, because the URL of a CGI request 
protocol which utilizes information specific to the client and is the same for multiple responses, one aspect of the present 
the server to reduce the amount and frequency of commu- invention extends the URL for a cache entry to include the 
nications. 40 data associated with a CGI request. This extension of the 

The present invention preferably utilizes a virtual socket URL information is described in detail below. By extending 

system such as is illustrated in commonly assigned U.S. the URL to include the specific data of a request the depth 

patent application Ser. No. 08/601,804 entitled CLIENT/ of the cache with respect to a CGI URL may be increased 

SERVER COMMUNICATION SYSTEM, the disclosure of such that an increased number of cache "hits" may result for 

which is incorporated herein by reference as if set forth fully. 45 CGI requests. Thus, the amount of data transferred across 

The present invention also preferably utilizes the data reduc- the external communication link may be further reduced for 

tion techniques described in commonly assigned U.S. patent the CGI requests. However, because there may be a large 

application Ser. No. 08/601,753 entitled TIME COHERENT number of CGI requests that differ only slightly, it may be 

CACHING SYSTEM, and in commonly assigned U.S. desirable to limit the number of CGI responses that are 

patent application Ser. No. 08/601,903 entitled DIFFER- 50 cached for any on CGI URL. Such an embodiment of the 

ENCING COMMUNICATION SYSTEM, the disclosures present invention is described below, 

of which is incorporated herein by reference as if set forth The present invention utilizes an extension to a URL entry 

fully. The present invention also preferably utilizes the for maintaining multiple versions of responses to a CGI 

rebasing techniques described in commonly assigned U.S. request. In a typical CGI interaction, the response to an 

patent application Ser. No. 08/819,222 entitled METHOD 55 initial CGI request is a CGI form which contains HTML 

AND APPARATUS FOR OPTIMAL REBASING OF WEB input elements (of the form <input type-xxx . . . > or 

PAGE TRANSMISSION, the disclosure of which is incor- <select ...>... </selecl>) to specify the type of input 

porated herein by reference as if set forth fully. actions presented to the end user and, for some types, the 

While the present invention has and will be described parameter values that are to be associated with a given 

with respect to a single web browser application and a single 60 action. The type of input elements include: checkbox; radio; 

web server application, as will be appreciated by those of text; and submit Another HTML construct for specifying 

skill in this art, the benefits and advantages of the present input data is a select element, which permits the selection of 

invention may also be achieved with multiple web browsers one or more items from a list. The submit input element is 

associated with a single web server. Thus, the methods, used to signal that the request half is to be submitted. Default 

apparatus and program products of the present invention 65 values can be coded as part of the HTML input element 

may be used in connection with multiple browsers each specification. When the submit button is pressed, all the 

communicating with a client-side intercept module and these input parameter names and values are constructed into a 
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linear string and sent as part of an HTTP GET or POST parameter is a checkbox element, then the request parameter 

request to the CGI server named in the uniform resource string is searched for all keyword name/value pairs with the 

locator denned for this form. same keyword and these are placed in a temporary value list 

The extended URL described above may be computed as ( block 289 >- ^ CRPT entrv b men searched and any 

a function of the HTTP request input parameters and values, s entries in the value list are removed if they are the default 

To create the extended URL, the client-side and server-side enlr y ,de . nUfl ? d "? <*» CRPTentry (block 290). Any remain- 

intercept modules capture the input parameter information "* "?™» ™ thea t 

for a CGI form as it is passed from the web server to a web °* ens ™ T & £ 1 ^ " ^ 

, . r . . It _ 4 , « . _ . . obtained as described above. 

browser. This operation is illustrated in FIG. 3. As seen in r - . . . . . . , . . 

FIG. 3, when either intercept module receives a message 10 ., f K lhe P"»""««. * a checkbox, then it is 'determined 

from the web server, the message is inspected to determine * ^ f rameter * a F adl ° bu " 0D < bl °* 2 J}^ If ,he 

if the message is a CGI input form (block 270). This * a radK> eleme ' 5 

determination may be made, as described above, by inspect- ™ Kb f ^^T^ u*™^ 2 ^L f 

ing the HTML data for the presence of a <form> element. As * e "f/ 0 bu "° n K set , to me d f auI V^P£ ^£$1 

is seen in block 272 if a form is found, then the form's URL 15 * en * e nc , xt P aramet » 15 obtained (blocks 300 and 282). If 

and all its input elements are scanned and saved in a CGI tb * T ^° element 15 set f° ,he defaul, ' 

Request Parameter Table (CRPT) using the URL of the form " d value ™ a PP ende d to * e extension string (block 295). 

_ i™, .u. ..ui- -A. . u .u .i. The next element is then obtained as described above, 
as a key into the table. These operations occur at both the 

server-side intercept module 40 and the client-side intercept If me Parameter is not a radio element, then it is deter- 

module 30 2° mined if the parameter is a "select" element (block 296). If 

u . j r . , .. • ™™. , the parameter is not a select element, then the next element 

Having saved the form mformation in the CRPT both fc obt ^ m± ]f ^ efcr js a S6lcct ^ mea ^ 

intercept modules may use this mformation to generate the request parameter string is searched for all keyword name/ 

extended URL keys identified above which may be used as ? ■ % « *u i j j *u i j • 

, • . »i_ u * j j , . Tn. va l ue P 1115 ^th ™ same keyword and these are placed m 

a key into the cache resident on each computer. Thus. „ „„i„« /ui i/n.t\ n. mirr - 4U 

- ' , . t A , - , , TT U T r 15 a temporary value list (block 297). The CRPT entry is then 

references herein to the use of the extended URL refers to 25 J d ^ fa me va / ue list ^ if mey 

the operations described below. To create an extended URL, afe ^ defeult en ideatified m ^ CRpT ^ 

when a response from a web server is received by either 29jJ) ^ remaini > iQ ^ yalue ^ ^ ^ 

m fr^ ^' , r "^T 15 U?, ?,o°, 0k u U 5 'PP^ed to the extension string (block 299). The next 

in the CRPT to see tf a form associated with the URL had e £mem ^ then obtained as desc( & e \, ab(Jve 

been received. When a request is transmitted through the „«_ c A , , , . JL 

intercept modules a copy of the request is created a! each ™™ of ^ P^tcis haye been proceed the 

oc o r . m ,.ct n u:'* if 0 pddt ■ i t . . extension string contains the values to be added to the URL 

module as a request object. 11 a CKP1 entry is found, the . . , °^ , , Tinr „ . ^ , , TTriT . 

parameter string is extracted from the request object corre- ? P r °^ de ^ e eXt ^f ^ ms e f aded URL , ma * ^ 

sponding to the response. This parameter string is combined be H ^ « below to P rovide access t0 the 

with the input elements from the CRPT to generate the 35 11101 cnenl base ^ aclies - ^ 

extended URL which is used as the cache key to look for a ^ « exam P le f of mc l _ crc l aUon of me _ e ^ nded URL > 

base form in the cache. consider an input form which contains the following ele- 

In computing the extended URL, the following guidelines menls * . 1 /••_■/ 

_ . , . . ' . . 7°, . <torm action^ http://www.mybrokerxom/cgi-bin/ 

may be used in parsing the parameter stnne of the request. An , : Y ' „ J & 

n c ui r f * u j £ j j r i r 40 status.exe method= post > 

Preferably, if a parameter matches a defined default value for . r 

the parameter the parameter is ignored. If a keyword string <m P ut { W ta text " name=*usenwme" . . . > 

is passed with a null value then the parameter is also ignored <1Q P ut type»"radio" nameo"invtype ,, value="stocks 

in determining the extended URL. Finally, if an input checked" > 

element defines a parameter that can have an unlimited set 45 <input type="radio" name«=«"invtype" value="bonds"> 

of values (e.g., type=»"text"), then the request parameter <input type='* radio w name^'invtype" value="tbills , *> 

value is ignored. Including such values leads to an <input type="racuV name="invtype" value="savings ,1 > 

unbounded number of bases for the same URL which may When the input form is sent to the browser, these elements 

prove to be unmanageable for the cache. are stored as an entry in the CRPT with a key of the above 

The above preferred guidelines lead to the creating of an 50 URL - if the HTTP request generated from filling out 

extended URL by the operations illustrated in FIG. 4. As ^ form and submitting the form is: 

seen in FIG. 4, the extension to be added to the URL for use POST www.mybroker.com/cgi-bin/status.exe 

as the cache key is first nulled (block 280). Then an input name-Smith&invtype-tbills 

parameter from the request object is obtained from the The request is then sent to the server at www.mybroker.com 

request string (block 282). If the parameter is null (block 5S and the CGI program, status.exe, is executed and generates 

284) then, if there are more parameters (block 300), the next a reply page that is returned to the server-side intercept 

parameter is obtained (block 282). Otherwise, the parsing module. The server-side intercept module will check the 

ends (block 300). If the parameter is not null, then it is server cache to see if a base exists in order to perform the 

determined if the parameter is a "text" element (block 287). differencing. Thus, the extended URL is computed to access 

If the element is a text element then it is determined if the 6Q the server base cache. The request URL is used to locate the 

text element is set to the default value (block 285) and if so CRPT entry containing the input elements described above, 

the next parameter is obtained (blocks 300 and 282). If the Then using the flow illustrated in FIG. 4, the extension string 

value is not the default then the keyword of the parameter is to the URL is calculated as: 

appended to the extension string to indicate a non-default /name/invtype/tbills 

text parameter was present (block 286). 65 This results in an extended URL of: 

If the parameter is not a text element, then it is determined www.mybroker.com/cgi-bin/status.exe/name/invtype/ 

if the parameter is a "checkbox" element (block 288). If the tbills 
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This extended URL is then used as a key into the server 
cache to access the appropriate base form. A similar process 
takes place on the client side of the transmission link. 

The operation of the present invention utilizing the 
extended URL to control caches of CGI responses will now s 
be described with reference to FIGS. 5 through 8. FIG. 5, 6, 
7, and 8 reflect the operations carried out by the client-side 
intercept module 30 and the server-side intercept module 40 
in an aspect of the present invention which utilizes differ- 
encing to reduce the data transferred over the external 10 
communication link 35. As seen in FIG. 5, block 200 
illustrates the receipt by the client-side intercept module 30 
of an HTTP request from the web browser 10. As reflected 
in block 205, the client-side intercept module 30 interrogates 
the intercepted HTTP request from the web browser 10 to 15 
determine if the request is to a common gateway interface 
(CGI). If the request is not to a common gateway interface, 
then the client-side intercept module 30 passes the request to 
the server-side intercept module as discussed in the above - 
referenced related application which is illustrated by block 20 
206 of FIG. 5. 

If, however, the web browser originated communication 
corresponds to a CGI request, then the "Yes" path out of 
block 205 is taken. As reflected in block 210, the client/ 
server intercept module 30 determines if a client base cache 25 
entry exists corresponding to the HTTP data stream which 
was previously to be provided to the web browser in 
response to a corresponding CGI request. This interrogation 
of the CGI request may be accomplished by comparing the 
extended URL of the web browser originated communica- 30 
tion to the extended URLs stored in the client base cache. 

The client base cache may be initialized by storing the 
first HTTP data stream received by the client-side intercept 
module 30 which is to be provided to the web browser 10 for 
a given extended URL This base cache entry may be 35 
maintained over numerous instances or sessions of the web 
browser 10. The client base cache entries may be updated as 
reflected in FIGS. 5, 6, 7, and 8. If a client base cache entry 
exists corresponding to the extended URL for the web 
browser originated communication, then the CRC to be sent 40 
to the server-side intercept module 40 over the external 
communication link 35 is set equal to the CRC for the client 
base cache entry as reflected in block 211 of FIG. 5. If no 
client base cache entry exists, then the "No" path out of 
block 210 of FIG. 5 is taken. The client side intercept 20 then 45 
determines if the creation of a new cache entry will exceed 
the maximum number of cache entries allowed for the URL 
of the CGI request (block 207). If a new entry will not 
exceed the maximum number of allowed entries, then the 
CRC for the request to be sent over the external communi- 50 
cation link 35 to the server-side intercept module 40 is 
nulled. This operation is reflected in block 212 of FIG. 5. 

If, however, the creation of a new entry will exceed the 
maximum number of entries allowed, then the base cache 
entry that is the closest match to the extended URL of the 55 
request is used as the client base cache entry as if there were 
a match (block 208). In determining the closeness of the 
cache entries any number of method may be utilized includ- 
ing using the number of matches of data elements of the 
extended URL with the most matches being considered "the 60 
closest" match. If multiple entries have the same number of 
matches then the oldest entry could be updated. The CRC 
sent to the server-side intercept module 40 is then set to the 
CRC of this substitute cache entry (block 211 and block 
213). 65 

Block 213 illustrates the operations of sending the CGI 
request to the server-side intercept module 40 over the 



,087 

12 

external communication link. As reflected in block 213, the 
client-side intercept module 30 transmits the HTTP request 
and the request CRC which has either been set to null if no 
client base cache entry exists for the extended URL of the 
CGI request or has been set to the CRC of the client base 
cache entry if an entry does exist or the substitute entry as 
described above. Thus, the client-side intercept module has 
converted the CGI request to a client/server specific 
protocol, transmitted the client/server specific communica- 
tion over the external communication link to be received by 
the server-side intercept module 40. 

The actions taken by the server-side intercept module 
when a CGI request is received are reflected in FIG. 7. The 
receipt of the CGI request by the server-side intercept 
module 40 is shown in block 220. When the server-side 
intercept module 40 receives the CGI request, it saves a copy 
of the CRC value and the HTTP request. As seen in block 
221, the server-side intercept module 40 passes the HTTP 
request to the web server 20. 

When the server-side intercept module 40 receives a 
response to the HTTP request corresponding to the web 
browser originated communication or CGI request, the 
server-side intercept module 40 receives this response as an 
HTTP data stream as reflected in block 230 of FIG. 8. As 
seen in block 230, the server-side intercept module 40 saves 
the HTTP data stream and computes a CRC value for the 
HTTP data stream received from the web server 20. The 
server-side intercept module 40 also nulls the difference 
value to initialize the difference data. The server-side inter- 
cept module then determines if the response received as a 
web server originated communication is a response to a CGI 
request as shown in block 235. If the answer is no, then the 
"No" path out of block 235 of FIG. 8 is taken and the 
operations of block 236 are executed to send the HTTP data 
stream to the client-side intercept module. As reflected in 
block 236, this operation may involve the caching opera- 
tions described in the related applications discussed above. 
If the response received in block 230 is a response to a CGI 
request, then the "Yes" path out of block 235 is taken and the 
server-side intercept module then determines if a server base 
cache entry exists for the CGI response as reflected in block 
240. 

A server base cache entry may be created the first time the 
server-side intercept module 40 receives a response to a CGI 
request. In this instance the result of the conditional reflected 
in block 240 will cause the "No** path to be taken out of 
block 240. The server-side intercept module 40 will then 
determine if the creation of a server cache entry will exceed 
a predetermined maximum number of entries allowed for a 
given CGI URL (block 242). If the number is not exceeded, 
then the server-side intercept module 40 creates a server 
base cache entry corresponding to the CGI request by 
storing the extended URL for the CGI, the HTTP data stream 
for the response to the CGI request, and the CRC for the 
HTTP data stream. This operation is reflected in block 241. 
To be compatible with the coherent cache system described 
in the related applications discussed above, the server base 
cache entry may also include the Storage Date Time. As used 
herein, the term server CGI base form refers to the server 
base cache entry corresponding to the CGI request received 
from the web browser 10. 

If the creation of a server base cache entry would cause 
the number of cache entries associated with a CGI URL to 
be exceeded, then the server base cache entry with the 
closest extended URL will be selected as the server base 
form (block 243). In determining the closeness of the cache 
entries any number of methods may be utilized including 
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using the number of matches of data elements of the methods of determining if it would be more efficient to 

extended URL with the most matches being considered "the update the server base form such as are described in U.S. 

closest" match. The operations of the server-side intercept patent application Ser. No. 08/819,222 entitled METHOD 

module 40 may be carried out using the same operations AND APPARATUS FOR OPTIMAL REBASING OF WEB 

described below for differencing with a server CGI base 5 PAGE TRANSMISSION, the disclosure of which is incor- 

form except that the extended URL for the form will be the porated herein by reference as if set forth fully. Other 

closest extended URL rather than the actual extended URL methods of determining if the server base cache entry 

of the CGI request. corresponding to the CGI request should be updated may 

If a server base cache entry exists corresponding to the include time coherency such as that described in the related 

CGI request then the "Yes J1 path out of block 240 is taken. 10 applications discussed above or other methods known to 

Also, if the creation of a new cache entry would exceed the those with skill in the art to determine if the difference data 

maximum allowed number of base forms for a CGFs URL, has increased to such an extent that rebasing to create a new 

then the following operations are performed using the clos- base cache entry would improve system performance, 

est matched server cache entry as described above. The If rebasing of the server is required, then the server base 

server-side intercept module compares the CRC of the 15 cache entry is updated with the HTTP response data and the 

selected server base cache entry to the CRC of the response CRC of the HTTP data (block 255). Furthermore, the 

received from the web server 20. These operations are difference data may be nulled as is reflected in block 255. 

reflected in block 245 of FIG. 8. If the CRCs are the same, If rebasing of the server is not required, then the "No" 

then the server-side intercept module determines if the CRC path out of block 247 is taken and the server-side intercept 

for the server base cache entry corresponds to the CRC for 20 module 40 carries out the operations of block 250 to 

the client base cache entry. If these two CRC values are the determine if the CRC of the client base cache entry is the 

same, then the client base cache entry, the server base cache same as that of the selected server base cache entry. In other 

entry, and the response received from the web server 20 all words, is the server CGI base form identical to a client CGI 

contain the same HTTP data stream. The comparison of the base form which are the selected base cache entries of the 

server base cache entry to the client base cache entry is 25 server and the client which correspond to the particular CGI 

reflected in block 250. request of the web browser originated communication or 

If the two base cache entries are the same, then the which are the closest match to the extended URL of the CGI 

server-side intercept module need not send the base cache request. If the base forms are the same, then the client does 

entry to the client-side intercept module 30 and so, as not need to be rebased and the HTTP data stream informa- 

reflected in block 251, the HTTP data stream data to be 30 tion is nulled, as reflected in block 251. The server-side 

transferred to the client-side intercept module 30 is nulled. intercept module 40 then sends the difference response to the 

The server-side intercept module 40 then converts the HTTP client-side intercept module 30 by sending the CRC of the 

data stream received from the web server 20 to a client/ server base cache entry corresponding to the CGI request 

server specific communication protocol by transmitting the (i.e. the CRC of the server CGI base form), by sending the 

CRC of the HTTP data stream stored in the server base cache 35 nulled HTTP data stream which would correspond to the 

corresponding to the CGI request, the nulled HTTP data base data and by sending the difference data determined in 

stream data and the nulled difference data to indicate that the block 246. These operations are again reflected as block 252 

response to the CGI request was identical to the client base of FIG. 8. 

cache entry, as illustrated in block 252. If the server-side intercept module 40 determines that the 

Returning to block 245, if the CRC for the server base 40 CRCs are not the same for the client CGI base form and the 

cache entry corresponding to the CGI request or the selected server CGI base form, then the client needs to be rebased. 

server base cache entry is different than the CRC for the The client rebasing operation consists of sending the server 

response received from the web server in response to the CGI base form to the client-side intercept module 30. To 

CGI request originated by the web browser, then the "No" perform this operation, the server-side intercept module sets 

path out of block 245 is taken. The server-side intercept 45 the HTTP data stream data to be sent to the client-side 

module 40 then carries out the operations reflected in block intercept module 30 equal to the server CGI base form. This 

246. The server-side intercept module 40 compares the operation is reflected in block 253. The server-side intercept 
intercepted CGI response to the selected server base cache module 40 then converts the HTTP data stream received 
entry corresponding to the intercepted CGI request ("the from the web server to a client/server specific protocol by 
server CGI base form"). This comparison of the intercepted 50 sending the CRC of the server CGI base form, the HTTP 
CGI response to the server CGI base form provides CGI data stream data corresponding to the server CGI base form, 
difference data which corresponds to the difference between and sending the difference data between the CGI base form 
the intercepted CGI response and the server CGI base form. and the response received from the web server as seen in 

The differencing may be performed by any method known block 252. This information is then transmitted over the 

to those of skill in the art for determining the difference 55 external communication link 35 to the client-side intercept 

between a base form and a modified form. One method of module 30. 

differencing suitable for use in the present invention is The operations of the client-side intercept module upon 

described in "A Cross-Platform Binary Diff" by Coppieters, receipt of a response from the server-side intercept module 

Dr. Dobb's Journal, May 1995, pp. 32-36, the disclosure of 40 are shown in FIG. 6. The receipt of the response from the 

which is incorporated herein by reference as if set forth fully. 60 server-side intercept module 40 by the client -side intercept 

Other methods which may be used in determining the module 30 is reflected in block 260, As seen in block 265, 

difference data include those described in IBM Technical the client-side intercept module 30 determines if the 

Disclosure Bulletin, Vol. 22, No. 8A, January 1 980 which is response is a response to a CGI request. If the response is not 

also incorporated herein by reference as if set forth fully. to a CGI request, then the client-side intercept module 

The server-side intercept module 40 then determines if the 65 carries out the operations of block 267 which may include 

server CGI base form requires updating as reflected in block the cache operations described in the related applications 

247. This determination may be made by any number of discussed above. If, however, the response is to a CGI 
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request, then the "Yes" path out of block 265 is taken. The 
client-side intercept module 30 saves the HTTP data stream 
data, the difference data, and the CRC acquired from the 
client/server specific data stream transmitted over the exter- 
nal communication link. These operations are reflected in 
block 266 of FIG. 6. 

The client-side intercept module 30 then determines if a 
client base cache entry corresponding to the intercepted CGI 
request exists which would contain a client CGI base form. 
This interrogation is shown in block 270 and may be carried 
out by examining the extended URL of the HTTP request or 
HTTP response. If a client CGI base form exists, then the 
"Yes" path out of block 270 is taken. The client-side 
intercept module 30 then compares the CRC received over 
the external communication link to that of the CRC of the 
client CGI base form as shown in block 275. If they are 
different, then the "No" path of block 275 is taken and the 
client rebases by updating the CGI base form by replacing 
the client base cache entry corresponding to the URL of the 
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The present differencing techniques may also be applied 
to non-CGI data. In such an instance, the server-side inter- 
cept module 40 would need to keep multiple generations of 
server base cache entries to allow for the possibility that 
client-side intercept modules of web browsers connected to 
the web server may have different base forms. The server- 
side intercept module could then compare the CRC received 
from the client-side intercept module with the CRC of each 
of the prior generations of server base forms until a match 
was obtained. The server-side intercept module 40 may then 
optionally rebase the client-side intercept module 30 or 
simply provide the difference data to the client-side intercept 
module 30. Thus, the differencing methodologies described 
herein with respect to the CGI request may apply equally to 
any HTTP request and response. 

In the drawings and specification, there have been dis- 
closed typical preferred embodiments of the invention and, 
although specific terms are employed, they are used in a 
generic and descriptive sense only and not for purposes of 
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with the HTTP data stream data received over the external 
communication link 35 from the server side intercept mod- 
ule 40, The client base cache entry also is updated with 
respect to the CRC for the HTTP data stream. These 
operations are reflected in block 276 of FIG. 6. 

If the CRC received over the external communication link 
35 is the same as the CRC of the CGI base form, then the 
server-side intercept module server CGI base form is the 
same as the client-side intercept module client CGI base 
form and the "Yes" path out of block 275 is taken. 

Whether the base forms are the same or the client is 
rebased, the operations reflected in block 277 are carried out 
by the client-side intercept module 30. Block 277 reflects the 
client-side intercept module 30 reconstructing the HTTP 
data stream corresponding to the communication from the 
web server 20 from the client/server specific data stream 
received over the external communication link 35 by com- 
bining the client CGI base form with the CGI difference data 
received over the external communication link 35 to create 
an HTTP data stream corresponding to the intercepted CGI 40 
response. As seen in block 278, this response is then 
provided to the web browser 10 as an HTTP data stream. 

If no CGI base form exists in the client corresponding to 
the URL of the CGI request, then the "No" path out of block 
270 of FIG, 6 is taken. The client-side intercept module 30 45 
then determines if the creation of a new cache entry would 
exceed the maximum number of entries allowed for a URL 
of a CGI request (block 272) if such is the case, then the 
client base form is set to the client cache entry corresponding 
to the CRC which was sent with the request (blocks 208 and 
211 of FIG. 5). The client -side intercept module then carries 
out the operations discussed above as if a client base cache 
hit had occurred. 

If the creation of a new entry would not exceed the 
maximum number of allowed entries, then, as seen in block 
271, the client-side intercept module 30 creates a client base 
cache entry corresponding to the extended URL of the CGI 
request by storing the extended URL, the CRC of the HTTP 
data stream received over the external communication link 
from the server-side intercept module 40, and the actual 
HTTP data stream data. Storing this information creates a 
client base cache entry corresponding to the intercepted CGI 
request and thus creates a client CGI base form. The 
client-side intercept module may then carry out the opera- 
tions of block 277 by reconstructing the HTTP data stream 
by combining or merging the client CGI base form with the 
CGI difference data which may have been nulled. 
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following claims. 
That which is claimed is: 

1. A method of reducing the data transmitted over a 
communication link from a first application resident in a first 
computer and from a second application resident in a second 
computer wherein the data is transmitted over an external 
communication link between the first computer and the 
second computer, the method comprising: 
storing HTML data from the first application to be pro- 
vided to the second application in response to specific 
data in a CGI request from the second application in a 
cache resident in the first computer to create a server 
base cache entry; 
storing HTML data to be provided to the second appli- 
cation in response to the specific data in the CGI 
request from the second application in a cache resident 
in the second computer to create a client base cache 
entry; 

evaluating CGI requests and specific data associated with 
the CGI requests from the second application to deter- 
mine if a client base cache entry corresponding to the 
evaluated CGI request and specific data exists to pro- 
vide a client base form; 

interrogating CGI requests and specific data associated 
with the CGI requests from the second application to 
determine if a server base cache entry corresponding to 
the interrogated CGI request and specific data exists to 
provide a server base form; 

intercepting the data stream corresponding to the response 
HTML data from the first application in response to the 
interrogated CGI request and the specific data from the 
second application prior to transmission of the response 
HTML data on the external communication fink; 

comparing the intercepted response HTML data to the 
server base form to provide difference data correspond- 
ing to the difference between the intercepted response 
and the server base form; 

sending the difference data to the second computer over 
the external communication link; 

acquiring the difference data transmitted over the external 
communication link sent by the first computer; 

reconstructing the response HTML data corresponding to 
the communication from the first application from the 
client/server specific data stream received over the 
external communication link by combining the client 
base form with the difference data received over the 
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external communication link to create a response data 
stream corresponding to the intercepted response; and 
providing the reconstructed data stream corresponding to 
the intercepted response HTML data to the second 
application. 5 

2. The method according to claim 1, wherein said step of 
storing HTML data from the first application to create a 
server base cache entry and said step of storing HTML data 
to be provided to the second application to create a client 
base cache entry, store the Uniform Resource Locator (URL) io 
of the CGI request and the specific data of the CGI request. 

3. The method according to claim 2, wherein said step of 
evaluating CGI requests and specific data includes the step 
of determining if a client base cache entry has a URL and 
specific data corresponding to the evaluated CGI request; * 5 
and 

wherein said step of interrogating CGI requests and 
specific data includes the step of determining if a server 
base cache entry has a URL and specific data corre- 
sponding to the interrogated CGI request. 

4. The method according to claim 3, wherein said step of 
evaluating CGI requests and specific data associated with 
the CGI requests from the second application to determine 
if a client base cache entry corresponding to the evaluated 
CGI request and specific data exists to provide a client base 
form includes the step of selecting as the client base form the 
client base form with the closest match to the URL and 
specific data; and 

wherein said step of interrogating CGI requests and 
specific data associated with the CGI requests from the 
second application to determine if a server base cache 
entry corresponding to the interrogated CGI request 
and specific data exists to provide a server base form 
includes the step of selecting as the server base form the 
client base form with the closest match to the URL and 
specific data. 

5. The method according to claim 1, wherein said step of 
storing HTML data from the first application to create a 
server base cache entry and said step of storing HTML data 
to be provided to the second application to create a client 
base cache entry, store such information if the number of 
client cache entries and server cache entries corresponding 
to the CGI request is less than a predetermined maximum 
entry value. 

6. The method of claim 1 further comprising the steps of: 
determining if the server base form is identical to the 

client base form; 

wherein said sending step comprises transmitting the 
server base form and transmitting the difference data to 
the second computer over the external communication 
link if said determining step determines that the server 
base form is not identical to the client base form; 

wherein said reconstructing step comprises reconstructing 
the intercepted response data stream corresponding to 55 
the response from the first application by combining the 
server base form received over the external communi- 
cation link with the difference data received over the 
external communication link to create a data stream 
corresponding to the intercepted response; and eo 

updating the client base form corresponding to the inter- 
rogated request by storing the received server base 
form as the client base cache entry corresponding to the 
interrogated request. 

7. The method of claim 1 wherein the first application 65 
comprises a web server and the second application com- 
prises a web browser. 
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8. The method of claim 1 wherein the external commu- 
nication link comprises a wireless communication link. 

9. A method of generating a base form for a CGI response 
differencing communication system, the method compris- 
ing: 

storing the input elements of a CGI input form; 
storing the parameters of a request using the CGI input 
form; 

receiving the response to the request using the CGI input 
form; 

generating an extended cache key from the URL of the 
CGI request, the input elements of the CGI input form 
and the parameters of the CGI request using the CGI 
input form; and 

storing as an entry in a cache the response to the request 
using the extended cache key as a key to the cache 
entry. 

10. The method according to claim 9, wherein said step of 
generating an extended cache key comprises the steps of: 

generating an extension to a URL of the CGI request 
including non-default parameters of the request to 
provide an extension string; and 

appending to the URL of the CGI request the extension 
string to provide the extended cache key. 

11. The method according to claim 10, further comprising 
the step of excluding from the extension string input param- 
eters having a set of possible values greater than a predefined 
value. 

12. The method according to claim 10, further comprising 
the step of excluding from the extension string text element 
values. 

13. The method according to claim 9, further comprising 
the steps of: 

generating difference data between an entry in the cache 
having a first extended cache key and a subsequent 
response to a CGI request having the same extended 
cache key; and 

transmitting over an external communication link the 
extended cache key and the difference data. 

14. The method according to claim 9, further comprising 
the step of reconstructing a CGI response from difference 
data associated with the extended cache key and the cache 
entry associated with the extended cache key. 

15. An apparatus for reducing the data transmitted over a 
communication link from a first application resident in a first 
computer and to a second application resident in a second 
computer wherein the data is transmitted over an external 
communication link from the first computer to the second 
computer comprising: 

means for storing HTML data from the first application to 
be provided to the second application in response to 
specific data in a CGI request from the second appli- 
cation in a cache resident in the first computer to create 
a server base cache entry; 

means for storing HTML data to be provided to the 
second application in response to the specific data in 
the CGI request from the second application in a cache 
resident in the second computer to create a client base 
cache entry; 

means for evaluating CGI requests and specific data 
associated with the CGI requests from the second 
application to determine if a client base cache entry 
corresponding to the evaluated CGI request and spe- 
cific data exists to provide a client base form; 

means for interrogating CGI requests and specific data 
associated with the CGI requests from the second 
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application to determine if a server base cache entry 
corresponding to the interrogated CGI request and 
specific data exists to provide a server base form; 

means for intercepting the data stream corresponding to 
the response HTML data from the first application in 5 
response to the interrogated CGI request and the spe- 
cific data from the second application prior to trans- 
mission of the response HTML data on the external 
communication link; 

means for comparing the intercepted response HTML 10 
data to the server base form to provide difference data 
corresponding to the difference between the intercepted 
response and the server base form; 

means for sending the difference data to the second 
computer over the external communication link; 

means for acquiring the difference data transmitted over 
the external communication fink sent by the first com- 
puter; 

means for reconstructing the response HTML data corre- 
sponding to the communication from the first applica- 
tion from the client/server specific data stream received 
over the external communication link by combining the 
client base form with the difference data received over 
the external communication link to create a response 
data stream corresponding to the intercepted response; 
and 

means for providing the reconstructed data stream corre- 
sponding to the intercepted response HTML data to the 
second application. 

16. An apparatus for generating a base form for a CGI 
response differencing communication system comprising: 

means for storing the input elements of a CGI input form; 
means for storing the parameters of a request using the 

CGI input form; 
means for receiving the response to the request using the 

CGI input form; 
means for generating an extended cache key from the 

URL of the CGI request, the input elements of the CGI 

input form and the parameters of the CGI request using 

the CGI input form; and 
means for storing as an entry in a cache the response to the 

request using the extended cache key as a key to the 

cache entry. 

17. A computer program product for reducing the data 
transmitted over a communication link from a first applica- 
tion resident in a first computer and to a second application 
resident in a second computer wherein the data is transmit- 
ted over an external communication link from the first 
computer to the second computer, the computer program 
product comprising: 

a computer- readable storage medium having computer- 
readable program code means embodied in said 
medium, said computer-readable program code means 55 
comprising: 

computer-readable program code means for storing 
HTML data from the first application to be provided 
to the second application in response to specific data 
in a CGI request from the second application in a 
cache resident in the first computer to create a server 
base cache entry; 

computer-readable program code means for storing 
HTML data to be provided to the second application 
in response to the specific data in the CGI request 65 
from the second application in a cache resident in the 
second computer to create a client base cache entry; 
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computer-readable program code means for evaluating 
CGI requests and specific data associated with the 
CGI requests from the second application to deter- 
mine if a client base cache entry corresponding to the 
evaluated CGI request and specific data exists to 
provide a client base form; 

computer-readable program code means for interrogat- 
ing CGI requests and specific data associated with 
the CGI requests from the second application to 
determine if a server base cache entry corresponding 
to the interrogated CGI request and specific data 
exists to provide a server base form; 

computer-readable program code means for intercept- 
ing the data stream corresponding to the response 
HTML data from the first application in response to 
the interrogated CGI request and the specific data 
from the second application prior to transmission of 
the response HTML data on the external communi- 
cation link; 

computer- readable program code means for comparing 
the intercepted response HTML data to the server 
base form to provide difference data corresponding 
to the difference between the intercepted response 
and the server base form; 

computer- readable program code means for sending the 
difference data to the second computer over the 
external communication link; 

computer-readable program code means for acquiring 
the difference data transmitted over the external 
communication link sent by the first computer; 

computer- readable program code means for recon- 
structing the response HTML data corresponding to 
the communication from the first application from 
the client/server specific data stream received over 
the external communication link by combining the 
client base form with the difference data received 
over the external communication link to create a 
response data stream corresponding to the inter- 
cepted response; and 

computer-readable program code means for providing 
the reconstructed data stream corresponding to the 
intercepted response HTML data to the second appli- 
cation. 

18. A computer program product for generating a base 
form for a CGI response differencing communication 
system, the computer program product comprising: 

a computer-readable storage medium having computer- 
readable program code means embodied in said 
medium, said computer-readable program code means 
comprising: 

computer-readable program code means for storing the 
input elements of a CGI input form; 

computer-readable program code means for storing the 
parameters of a request using the CGI input form; 

computer-readable program code means for receiving 
the response to the request using the CGI input form; 

computer-readable program code means for generating 
an extended cache key from the URL of the CGI 
request, the input elements of the CGI input form and 
the parameters of the CGI request using the CGI 
input form; and 

computer-readable program code means for storing as 
an entry in a cache the response to the request using 
the extended cache key as a key to the cache entry. 
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